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Abstract 

Experiments measuring contact formation between a probe and quencher in disordered chains 
provide information on the fundamental dynamical timescales relevant to protein folding, but their 
interpretation usually relies on simplified one-dimensional (ID) diffusion models. Here, we use 
all-atom molecular simulations to capture both the time-scales of contact formation, as well as the 
scaling with the length of the peptide for tryptophan triplet quenching experiments. Capturing 
the experimental quenching times depends on the water viscosity, but more importantly on the 
configurational space explored by the chain. We also show that very similar results are obtained 
from Szabo-Schulten-Schulten theory applied to a ID diffusion model derived from the simulations, 
supporting the validity of such models. However, we also find a significant reduction in diffusivity 
at small separations, those which are most important in determining the quenching rate. 
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Characterizing the configuration distribution and dynamics within unfolded or disordered 
peptides is a first step toward understanding more complex processes such as protein folding 
and aggregation [Tj. To this end, contact quenching pump-probe experiments are a sensitive 
measure of dynamics in disordered peptides, which can be used to determine loop formation 
rates M, helix-coil dynamics and even the folding rate of small proteins [8j. In these 
experiments, a probe is initially excited to a long-lived electronic state which can be quenched 
by contact with a second species distant in sequence, allowing the chain dynamics to be 
monitored. However, interpretation of the data usually requires fairly strong assumptions 
about the nature of the probe-quencher distance distribution and dynamics; inclusion of 
additional experimental data such as Forster resonance energy transfer (FRET) efficiencies 
can help to constrain the distance distributions [;Qj. 

An alternative approach to interpreting contact formation experiments is to use molec¬ 
ular simulations to compute quenching rates directly m. requiring only the knowledge of 
the distance dependence of the contact quenching rate. In principle, these can provide a de¬ 
tailed view of the chain dynamics, without the need for simplifying assumptions. Previous 
insightful work using atomistic simulations has been used to interpret contact quenching 
rates in short disordered peptides. It was found that the rates obtained from simulation 
needed to be reduced by a factor of 2-3 in order to match experiment, which was attributed 
to the viscosity of the water model being too low in the simulations ra. However, this 
also assumes that all rates, including the quenching rate, are slowed by the same viscosity 
factor, which may not be realistic. In addition, the interpretation is complicated by the 
over-collapsed nature of the disordered ensemble, relative to estimates from experimental 
measurements such as FRET, SAXS or light scattering for many atomistic force fields m 
fT5] . Recent physically motivated refinements of protein force fields, have been shown to yield 
more accurate equilibrium properties for disordered chains mm, and should not require 
any correction for viscosity, owing to the use of more accurate water models mm- These 
should alleviate the need for adjustments to the data, and allow a more direct interpretation 
of the experimental results. 

Here, we focus on a set of experiments in which the dynamics of a series of peptides 
of composition C(AGQ) n W-NH 2 (hereafter: AGQ n ) was monitored from the rate at which 
the triplet state of the tryptophan (W) at one end of the chain was quenched by van der 
Waals contact with the cysteine residue (C) at the other end j3H5|. The experiment is 
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illustrated by the kinetic scheme in Fig. [lj Briefly, after optical excitation, the termini of 
the peptide will diffuse relative to each other, and may be quenched on contact. In the 
extreme “diffusion-limited” scenario, the quenching on contact is so fast that the observed 
rate of triplet quenching k 0 b s is just the diffusion-limited rate of contact formation, ku+- 
In the opposite “reaction-limited” extreme, quenching is very slow and the termini must 
contact many times on average before a quenching event occurs, in which case the overall 
quenching rate depends only on the population of the contact states. The actual rate of 
quenching is usually somewhere between these scenarios, and so contains information on 
both the distance distribution and the dynamics of quenching. 

We have carried out extensive molecular dynamics simulations of a series of AGQ n pep¬ 
tides, for n — 1 — 6, using three related force fields: the Amber ff03* protein force field 
[16, T7j together with the TIP3P water model[T8]: the Amber ff03w protein force field [T9] , 
which is used in combination with a more accurate water model, TIP4P/2005 [15]; and the 
Amber ff03ws protein force field [13] which also uses TIP4P/2005 water, but with strength¬ 
ened protein water interactions. Specifically, in ff03ws the values of Lennard-Jones e for all 
protein-water atom pairs are scaled by a factor 1.10 relative to the standard combination 
rule, in order to correct the overly-collapsed nature of the disordered ensemble [13]. The 
simulations were run using Gromacs 4.5 or 4.6 IZSJ at a constant pressure of 1 bar and a 
constant temperature of 293 K for a total time of 2-10 /rs for each peptide and force field. 
Initial conditions were obtained either from short temperature replica exchange simulations, 
or from high-temperature runs at constant volume (see electronic supplementary information 
(ESI) for full details). 

In order to compare our results directly with experiment, we compute quenching rates 
using a step function for the dependence of the quenching rate q on the Trp-Cys separa- 
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FIG. 1. Kinetic scheme for triplet quenching: After initial laser excitation of the tryptophan to 
the triplet state, the terminal residues may come into contact with a diffusion-controlled rate kr>+- 
The triplet is quenched in contact with rate q. The termini may also separate with rate kp-. 
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FIG. 2. Decays of tryptophan triplet state. Overall decays calculated from simulations with the 
ff03* (black), ff03w (red), and ff03ws (green) force fields are shown for peptides AGQ n for n = 1 — 6, 
together with the corresponding experimental decays (broken purple lines). Symbols: simulation 
data; lines: single exponential fits to data. 

tion r cw , that is q(r cw ) = q c H(r c — r cw ), where q c = 8 x 10 8 s^ 1 is the constant quenching 
rate in contact, H(x) is the Heaviside step function and r c = 0.4 nm is the contact dis¬ 
tance. The distance r cw is taken as the minimum distance between the sulfur in the cysteine 
side-chain and the heavy atoms of the tryptophan indole ring system |3], [TO]. The ob¬ 
served quenching rate is then determined from the decay of the triplet survival probability 
S(t) = (exp[— f*° +t q(r cw (t'))dt'])t 0 , where the average is over equilibrium initial conditions 
t 0 , obtained by taking every saved frame of the simulation as a valid starting point. 

Overall decay curves for the triplet population determined using the step function form 
for q(r cw ) are shown in Figure |2j compared with experimental decays [4J. The simulation 
data for Amber fT03ws is in excellent agreement with the experiment for all n values except 
n — 1. As expected, there are large differences amongst the force fields, with quenching rates 
for ff03ws being significantly slower than for those ff03* and ff03w. Part of the difference 
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between ff03* and ff03ws is expected to be the ~ 3-fold lower viscosity of the TIP3P water 
model relative to TIP4P/2005 (the latter being very close to the true value) [ID.]. However, 
this is clearly not the only effect, since the decay for the ff03w force held, which also uses 
TIP4P/2005 water, is only slightly slower than that for ff03*. Therefore, the change in 
equilibrium conformational distribution from ff03w to ff03ws must also play a role in the 
observed difference. 


MD simulation 


Diffusion model 



FIG. 3. Dependence of quenching times (inverse of quenching rates) on chain length. Top, middle 
and bottom rows show the overall, reaction-limited and diffusion-limited quenching times, respec¬ 
tively. Simulation data for Amber ff03*, ff03w and ff03ws are shown by black, red and green 
symbols, respectively. Filled and empty symbols are for step-function and exponential distance de- 
pendence respectively. Experimental data are shown by purple symbols, and n h scaling expected 
for a Gaussian chain by the broken line. Left panels are rates calculated directly from simulation 
and right panels are those calculated from ID diffusion model using SSS theory. 
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We summarize the peptide-length dependence of the observed quenching rate in Figure [3} 
This confirms that the observed rate is in excellent agreement with the experimental data for 
ff03ws, while at the opposite extreme, ff03* results in rates which are almost independent of 
peptide length. The ff03ws results approximately follow an n^ 2 dependence of the reaction- 
limited quenching time on the number of peptide bonds rib, as expected for a Gaussian chain 
The power law which best fits the data is n l h ' iS±0A2 which is also in agreement with the 
trend in experiment toward rq/ 2 for longer AGQ sequences |4], as well with the fit to data 
for a different peptide sequence (n^' 36±0 ' 26 ) [2]. Interestingly, these quenching rates exhibit 
a very different scaling compared with loop formation rates in single stranded DNA |21| . 

We can obtain more insight into the contributions to the observed relaxation rate by 
splitting it into diffusion-controlled and reaction-controlled parts [1], via k~^ s = k^]_ + k R ] . 
We determine the reaction-limited k r by integrating over the Trp-Cys distance distri¬ 
bution, k R = J 0 °° q(r cw )P(r cw )dr cw . The diffusion limited rate can be obtained by us¬ 
ing a step function for the survival probability S(t) and averaging over time origins to, 
S(t) = ( H(t c (t 0 ) — t — to))t 0 . Here, t c {t 0 ) is the first time after to when the Trp and Cys con¬ 
tact, H(x) is again the Heaviside step function, and all time points in the simulation where 
the probes are not already in contact are used as separate time origins to. This calculation 
(Figure [3j reveals that for all of the peptides, the observed rates are in fact much closer 
to the reaction-limited rates, although there is a non-negligible contribution from diffusion. 
Interestingly, the slowdown in the diffusion-limited rate from ff03* to ff03w (by changing 
from TIP3P to TIP4P/2005) is very close to the 2-3 fold expected from the change of water 
viscosity. There is an additional slowdown in the diffusion limited rate when moving from 
ff03w to ff03ws, which presumably arises from the larger configurational space which must 
be explored due to the more expanded chain [T3]. In summary, it is clear that most of the 
improved agreement with experiment which we obtain by using Amber ff03ws comes from 
the reaction-limited rate. 

In the above analysis, we have assumed a very simple distance dependence of the rate of 
quenching. As an alternative, assuming that the quenching occurs via an electron transfer 
mechanism ra, we use an exponential distance-dependent rate g(r cw ) = k 0 exp[/3(r cw — r c )] 
where r c is the same and we have fitted the parameters ko = 1 x 10 8 s -1 and /3 = 33.33 
nnr 1 to bimolecular quenching data for tryptophan and cysteine embedded in a glass [22] 
(see ESI for full details). The resulting rates, shown by empty symbols in Fig. [3j are slightly 
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FIG. 4. Distribution of Trp-Cys distance for each chain length for AGQ n peptides, for three 
force fields. Symbols show the mean distance and curves are power law fits. 

slower, but generally in good agreement with those obtained from the step function distance 
dependence, suggesting that our conclusions are not overly sensitive to the particular form 
used for the rate. Considering the extremely sharp distance-dependence of the quenching 
rate, it is quite reasonable that the step function can be a good approximation. 

To understand the relationship between the chain dynamics and its structural properties, 
we characterize the equilibrium ensemble of conformations sampled in these simulations by 
the distribution of distances between the tryptophan and cysteine, shown in Figure [IJ These 
reveal distinct differences amongst the different force fields, which become more apparent as 
the number of AGQ repeats is increased: the simulations with Amber ff03* and ff03w tend 
to be quite collapsed, while those with ff03ws are relatively expanded. Notably, the mode of 
the distance distributions for ff03* hardly shifts as a function of chain length n, remaining 
near ~ 1 nm. For ff03w, weak expansion of the chain as a function of n from ~ 1 to 1.5 nm 
is observed. In contrast, AGQ n expands with n for the ff03ws force field as expected for a 
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chain in good solvent. 

We have quantified the polymer scaling properties of AGQ n peptides by fitting the de¬ 
pendence of the mean Trp-Cys distance on the number of peptide bonds to a power 
law r cw = An^ (similar results are obtained using the end-to-end distance), with A fixed to 
0.6 nm for all peptides. The exponents of 0.30 (0.02) and 0.36 (0.01) for ff03* and ff03w 
respectively are indicative of a chain in poor solvent [23], while the ff03ws exponent of 0.47 
(0.01) is close to the average exponent of 0.46 (0.05) determined experimentally for unfolded 
and disordered proteins [22]- The trends for the reaction limited rates are consistent with 
the equilibrium distance distributions, with the collapsed ff03* and ff03w being very similar 
to one another, and relatively independent of chain length. An important difference be¬ 
tween these two force fields is that the reaction-limited rates for ff03* even slightly increase 
with rib, which is not expected. Lastly, we note that an important distinction relative to 
the distributions frequently assumed in interpreting experiments [3 IS], is the existence of 
a separate short-range peak for the contact population in Fig. [4] The relative orientation 
of the Trp and Cys appears to be broad with no strongly preferred interaction modes (see 
ESI). The lifetime of this population is 0.4-0.9 ns for the ff03ws force field, depending on 
the peptide. 

Next, we test an approximation commonly used to analyze experimental data on contact 
formation, namely that the dynamics of the chain can be approximated as one-dimensional 
diffusion along the Trp-Cys distance coordinate. To investigate the accuracy of such a 
model for capturing the dynamics in the full phase space, we have fitted our simulation data 
r cw (t) to a ID diffusion model, using an established Bayesian approach ]26l[29]. Briefly, the 
method attempts to find the diffusive model, defined by a potential of mean force F(r cw ) = 
— lnpeq(' r cw) and position-dependent diffusion coefficients D(r cw ), whose propagators best 
match the observed history of the simulations (details in ESI). The diffusion coefficients thus 
obtained are shown in Figure [5] for ff03ws as a function of the number of AGQ repeats n in 
the peptides. 

The diffusion coefficients we estimate are very comparable to those obtained for the 
same peptide from direct analysis of contact quenching data, ~ 0.2 nm 2 ns _1 [I], and from 
MD simulations, 0.3 — 0.9 nm 2 ns~ 1 [T0] (after considering the low viscosity of the TIP3P 
water model used), as well as with diffusion coefficients estimated for an unfolded protein 
from single molecule FRET in water ~ 0.1 nm 2 ns _1 [30]. However, our analysis reveals 
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FIG. 5. Position-dependent diffusion coefficients D(r cw ), for each peptide AGQ n , n = 1 — 6. Error 
bars calculated from division of data into 5 non-overlapping blocks. Vertical lines indicate the 
backbone extension for a fully extended chain. Empty symbols for n = 3 are results from a 5 nm 
simulation box (vs 4 nm for solid symbols). Horizontal line is constant diffusion coefficient D cons t 
needed to fit the data for n = 3 — 6. 

a significant distance-dependence to the diffusion coefficient not included in prior work. 
Specifically, the diffusion coefficients vary relatively little at large separations, but strongly 
decrease at short probe-quencher distances, most likely due to the increased chain density 
at small distances, as well as hydrodynamic effects as the Trp and Cys approach each 
other. Remarkably, the D(r cw ) curves are nearly superimposable for short and intermediate 
separations r cw of Trp and Cys, for all of the peptides. Each peptide deviates from this 
common curve only when it approaches its maximum extension (vertical broken lines in Fig. 
[5]). This is not a finite size effect, as we obtain almost identical results for n = 3 with a 
larger simulation box (Fig. [5]). 

Although we have been able to determine a “best-fit” one-dimensional model, this by itself 
does not guarantee that the dynamics of this model is faithful to that of the full simulation 
(projected onto the same coordinate). To check this, we have used Szabo-Schulten-Schulten 
(SSS) theory [ST] to compute rates from the diffusion model for all simulations and compare 
them with those computed without dynamical approximations from the simulations (details 
in ESI). The results, shown in Figure [3] are in excellent agreement with the direct analysis 
of the simulations. We can also try to estimate the effective constant diffusion coefficient 
A^nst which would be required to match the measured diffusion-limited quenching rates. 
For n — 1 — 2, we obtain D const ~ 0.3 nm 2 ns^ 1 , and for n — 3 — 6, D const ~ 0.15 nm 2 ns^ 1 


9 













(the latter value in close agreement with the experimental estimate Jf| of ~ 0.17 nm 2 ns -1 ). 
This is also expected from ESI eq. 2 as the D{r ) at short separations are effectively weighted 
much more, thereby helping to rationalize the experimental observation that the diffusion 
coefficient for describing contact formation is about an order of magnitude smaller than the 
relative bimolecular Trp-Cys diffusion coefficient pE|. 

Our results indicate that treating contact formation using simple ID diffusion models 
captures the relevant dynamics accurately, justifying the use of such models in interpreting 
experiment. This is a remarkable result because there are many situations where the end-end 
distance is in fact not a good reaction coordinate, except in the presence of a mechanical 
pulling force [32j [33]. However, the simulation results do suggest additional complexity 
beyond what could reasonably be assumed a priori when interpreting the experimental 
data: namely that the distance distribution functions P(r cw ) include an additional peak 
at short separations corresponding to the contacting residues, and the diffusion coefficients 
D(r cw ) exhibit a strong distance-dependence at the short separations, which are most im¬ 
portant for determining the diffusion-limited rate of contact formation: indeed the effective 
position-independent diffusion coefficients obtained by fitting SSS theory to experimental 
quenching rates would be almost entirely determined by the diffusion coefficients at the 
shortest probe-quencher separations. These results should aid in the interpretation of future 
contact quenching experiments, as well as the many other types of experiment monitoring 
a single residue-residue distance, which are often also modeled as using ID diffusion: these 
include single molecule FRET, optical tweezers, and atomic force microscopy experiments. 
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